SEQUENCE LISTING 



(1) GENERAL. INFORMATION : 

(i) APPLICANT: Breece, Tim 

Hayenga , Kirk 
Rinderknecht , Ernst 
Vandl en , R i chard 
fansura, Daniel 

(ii) TITLE OF INVENTION: PROCESS FOR PRODUCING RELAXIN 

(iii) NUMBER OF SEQUEN^S : 40 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Mr. Walter H. Dreger 

(B) STREET: 4 Embarca^ero Center, Suite 3400 

(C) CITY: San Franc i sec 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94111 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DC 

(D) SOFTWARE: Patentln Release #1 . 0> 



Version #1.25 




(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/080,354 

(B) FILING DATE: 21-JUN-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Dreger, Walter H. 

(B) REGISTRATION NUMBER: 24,190 

(C) REFERENCE/DOCKET NUMBER: A-58117/WHD 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 781-1989 

(B) TELEFAX: (415) 398-3249 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



-2- 

v (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

>t Lys Lys Asn lie Ala Phe Leu Leu Lys Arg 
1 \ 5 10 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUfeNCE CHARACTERISTICS: 

(A) LfcNGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRA^DEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE\peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Ser Trp Met Glu Glu Val lie Lys Leu Cys Gly Arg Glu Leu Val 
1 5 \ 10 15 

Arg Ala Gin lie Ala lie Cys Gly Met Ser Thr Trp Ser 
20 \ 25 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Lys Arg Lys Pro Thr Gly Tyr Gly Ser Arg Lys Lys Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

v Gln Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys Thr 
5 10 15 

Lys\Arg Ser Leu Ala Arg Phe Cys 

1 20 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IJ&NGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRAhDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE\peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lys Arg Lys Pro Thr GlyVyr Gly Ser Arg Lys Lys Arg 
1 5 \ 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids! 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Arg \rg Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Lys Lys Arg 
15 10 



-4- 



( 2 /^INFORMATION FOR SEQ ID NO: 8: 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 amino acids 
TYPE: amino acid 
:) STRANDEDNESS: single 
(1^ TOPOLOGY: linear 

(ii) MOLEC\LE TYPE: peptide 

(xi) SEQUENCES, DESCRIPTION: SEQ ID NO: 8: 

Lys Arg Lys Pr\ Thr Gly Tyr Gly Ser Arg Arg Arg Lys 
1 \5 10 

(2) INFORMATION FOR SEQ\lD NO : 9 : 

(i) SEQUENCE CHARACTi 

(A) LENGTH: 231 ba>se pairs 

(B) TYPE: nucleic afcid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9N 

ATG AAA AAG AAT ATC GCA TTT CTT CTT AAA CGG GAC TCA TGG ATG 
Met Lys Lys Asn lie Ala Phe Leu Leu Lys Arg Asp Ser Trp Met 
1 5 10 \ 15 



GAG 
Glu 



48 



GAA GTT ATT AAA TTA TGC GGC CGC GAA TTA GTT CGC G(SG CAG ATT GCC 
Glu Val lie Lys Leu Cys Gly Arg Glu Leu Val Arg Al\ Gin lie Ala 
20 25 \ 30 



96 



ATT TGC GGC ATG AGC ACC TGG AGC AAA AGG AAA CCC ACT Gto TAT GGT 
lie Cys Gly Met Ser Thr Trp Ser Lys Arg Lys Pro Thr Gl\ Tyr Gly 
35 40 45 



144 



TCT CGA AAA AAG AGA CAA CTC TAC AGT GCA TTG GCT AAT AAA Ti$T TGC 
Ser Arg Lys Lys Arg Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cy\ Cys 
50 55 60 



192 



CAT GTT GGT TGT ACC AAA AGA TCT CTT GCT AGA TTT TGC 
His Val Gly Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 
65 70 75 



231 



(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Li) MOLECULE TYPE: protein 

(xi\ SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Lys A^n lie Ala Phe Leu Leu Lys Arg Asp Ser Trp Met Glu 
1 \ 5 10 15 

Glu Val lie Lys Le\i Cys Gly Arg Glu Leu Val Arg Ala Gin lie Ala 
20 \ 25 30 

lie Cys Gly Met Ser Thr\Trp Ser Lys Arg Lys Pro Thr Gly Tyr Gly 
35 \ 40 45 

Ser Arg Lys Lys Arg Gin LeuVr Ser Ala Leu Ala Asn Lys Cys Cys 
50 55 \ 60 

His Val Gly Cys Thr Lys Arg Ser Deu Ala Arg Phe Cys 
65 70 \ 75 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 431.. 586 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC TCATTQCTGA 60 

GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAA GAACTGTGTG CGCAGGTApA 120 

AGCTTTGGAG ATTATCGTCA CTGCAATGCT TCGCAATATG GCGCAAAATG ACCAACAGCCN 180 

GTTGATTGAT CAGGTAGAGG GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA \ 240 

CGACGATACG GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA \oO 

AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT ATAGTCGCTT 36C 

TGTTTTTATT TTTTAATGTA TTTGTACGCA AGTTCACGTA AAAAGGGTAT CTAGAGGTTG 420 

AGGTGATTTT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT ATG TTC 469 
Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe 
15 10 
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GTTXTTT TCT ATT GCT ACA AAT GCC TAT GCA GAC TCA TGG ATG GAG GAA 
Val Phe Ser lie Ala Thr Asn Ala Tyr Ala Asp Ser Trp Met Glu Glu 
l\ 20 25 



517 



GTT ATT P&A TTA TGC GGC CGC GAA TTG GTA CGC GCG CAA ATA GCG ATA 
Val lie Ly\ Leu Cys Gly Arg Glu Leu Val Arg Ala Gin lie Ala lie 
30 \ 35 40 45 



565 



TGC GGT ATG AGT ACA TGG AGT TGAAGAA 
Cys Gly Met Ser\Thr Trp Ser 
>0 



593 



(2) INFORMATION FOR 9^Q ID NO: 37; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :\52 amino acids 

(B) TYPE: amW> acid 
(D) TOPOLOGY: Vine ar 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTIONS: SEQ ID NO: 37: 

Met Lys Lys Asn lie Ala Phe Leu Deu Ala Ser Met Phe Val Phe Ser 
1 5 \ 10 15 



lie Ala Thr Asn Ala Tyr Ala Asp Ser \rp Met Glu Glu Val lie Lys 



20 



25 



30 



Leu Cys Gly Arg Glu Leu Val Arg Ala Gli 
35 40 



Le Ala lie Cys Gly Met 
45 



Ser Thr Trp Ser 
50 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 438.. 1238 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC TCATTGCTGA 



60 



-7- 

GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAA GAACTGTGTG CGCAGGTAGA 120 

\GCTTTGGAG ATTATCGTCA CTGCAATGCT TCGCAATATG GCGCAAAATG ACCAACAGCG 180 

GTTCATTCAT CAGGTAGAGG GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA 240 

CGACGAtACG GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 300 

AAAAGTTAAt CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT ATAGTCGCTT 360 

TGTTTTTATT T^TTAATGTA TTTGTAACTA GTACGCAAGT TCACGTAAAA AGGGTATCTA 420 

GAGGTTGAGG TGAT^TT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT 470 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser 
15 10 



ATG TTC GTT TTT TCT ACT GCT ACA AAT GCC TAT GCA TCA GGC ACT ACA 
Met Phe Val Phe Ser Ile\Ala Thr Asn Ala Tyr Ala Ser Gly Thr Thr 
15 \ 20 25 



518 



AAT ACT GTG GCA GCA TAT AATNTTA ACT TGG AAA TCA ACT AAT TTC AAG 
Asn Thr Val Ala Ala Tyr Asn Deu Thr Trp Lys Ser Thr Asn Phe Lys 
30 3\ 40 



566 



ACA ATT TTG GAG TGG GAA CCC AAA CISC GTC AAT CAA GTC TAC ACT GTT 
Thr lie Leu Glu Trp Glu Pro Lys Pr<k Val Asn Gin Val Tyr Thr Val 
45 50 \ 55 



614 



CAA ATA AGC ACT AAG TCA GGA GAT TGG AAAS 
Gin lie Ser Thr Lys Ser Gly Asp Trp Lys 
60 65 



AAA TGC TTT TAC ACA 
>ys Cys Phe Tyr Thr 
75 



662 



ACA GAC ACA GAG TGT GAC CTC ACC GAC GAG ATT 
Thr Asp Thr Glu Cys Asp Leu Thr Asp Glu lie 
80 85 



AAG GAT GTG AAG 
Lys Asp Val Lys 
90 



710 



CAG ACG TAC TTG GCA CGG GTC TTC TCC TAC CCG GCA GG& AAT GTG GAG 
Gin Thr Tyr Leu Ala Arg Val Phe Ser Tyr Pro Ala GlyVsn Val Glu 
95 100 It 



758 



AGC ACC GGT TCT GCT GGG GAG CCT CTG TAT GAG AAC TCC CCA GAG TTC 
Ser Thr Gly Ser Ala Gly Glu Pro Leu Tyr Glu Asn Ser Pro G]\ Phe 
110 115 120 



806 



ACA CCT TAC CTG GAG ACA AAC CTC GGA CAG CCA ACA ATT CAG AGT TTj 
Thr Pro Tyr Leu Glu Thr Asn Leu Gly Gin Pro Thr lie Gin Ser Phe 
125 130 135 

GAA CAG GTG GGA ACA AAA GTG AAT GTG ACC GTA GAA GAT GAA CGG ACT 
Glu Gin Val Gly Thr Lys Val Asn Val Thr Val Glu Asp Glu Arg Thr 
140 145 150 155 



854 



TTA GTC AGA AGG AAC AAC ACT TTC CTA AGC CTC CGG GAT GTT TTT GGC 
Leu Val Arg Arg Asn Asn Thr Phe Leu Ser Leu Arg Asp Val Phe Gly 
160 165 170 



950 



\G GAC TTA ATT TAT ACA CTT TAT TAT TGG AAA 
L>s Asp Leu lie Tyr Thr Leu Tyr Tyr Trp Lys 
175 180 

AAG AAA ACA GCC AAA ACA AAC ACT AAT GAG TTT 
Lys Lys\Thr Ala Lys Thr Asn Thr Asn Glu Phe 
k90 195 

AAA GGA GAASAAC TAC TGT TTC AGT GTT CAA GCA 
Lys Gly Glu Asn Tyr Cys Phe Ser Val Gin Ala 
205 \ 210 

ACA GTT AAC CGG AXG AGT ACA GAC AGC CCG GTA 
Thr Val Asn Arg Lys\Ser Thr Asp Ser Pro Val 
220 X25 230 

GAG AAA GGC CAA TTC AGA\GAA ATA TTC TAC ATC 
Glu Lys Gly Gin Phe Arg (Slu lie Phe Tyr lie 
240 \ 245 

TTT 'GTG GTC ATC ATC CTT GTC >\TC ATC CTG GCT 
Phe Val Val lie He Leu Val IJfce He Leu Ala 
255 \260 



TCT TCA AGT TCA GGA 
Ser Ser Ser Ser Gly 
185 

TTG ATT GAT GTG GAT 
Leu He Asp Val Asp 
200 

GTG ATT CCC TCC CGA 
Val He Pro Ser Arg 
215 

GAG TGT ATG GGC CAG 
Glu Cys Met Gly Gin 
235 

ATT GGA GCT GTG GTA 
He Gly Ala Val Val 
250 



998 



1046 



1094 



1142 



1190 



ATA TCT CTA CAC TAAAATTCTC 
He Ser Leu His 
265 



12 



ATGTTTGACA G CTT AT CATC GATAAGCTTT 
AACGCAGTCA GGCACCGTGT ATGAAATCTA ACi 
TCACCCTGGA TGCTGTAGGC ATAGGCTTGG TTATGC^GGT 
ATATCGTCCA TTCCGACAGC ATCGCCAGTC ACTATGGCG^ 
TGATGCAATT TCTAT 



TTTATCACAG TTAAATTGCT 
CAT CGT CATC CTCGGCACCG 
ACTGCCGGGC CTCTTGCGGG 
GCTCCTAGCG C TAT ATG CGT 



1305 
1365 
1425 
1485 
1500 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe Se\ 
15 10 15 

He Ala Thr Asn Ala Tyr Ala Ser Gly Thr Thr Asn Thr Val Ala Ala 
20 25 30 

Tyr Asn Leu Thr Trp Lys Ser Thr Asn Phe Lys Thr He Leu Glu Trp 
35 40 45 



Glu Pro Lys Pro Val Asn Gin Val Tyr Thr Val Gin He Ser Thr Lys 



50 
Gly Asp 

Asp Le\i Thr 

Arg Val Phe 



Gly Glu Pro 
115 

Thr Asn Leu 
130 

Lys Val Asn 
145 

Asn Thr Phe 



Thr Leu Tyr 

Thr Asn Thr 
195 

Cys Phe Ser 
210 

Ser Thr Asp 

225 

Arg Glu lie 
Leu Val lie 



Trp Lys Ser 
70 

Asp Glu lie 

85. 

Ser Tyr Pro 
^0 

LeuVTyr Glu 
Gly Gln\ro 



Val Thr Val 
150 

Leu Ser Leu 
165 

Tyr Trp Lys 
180 

Asn Glu Phe 



Val Gin Ala 



Ser Pro Val 
230 

Phe Tyr lie 
245 

lie Leu Ala 
260 



55 
Lys Cys 

Val Lys 

Ala Gly 

Asn Ser 
120 

Thr He 
135 

Asp 

Arg A* 

Ser Ser 

Leu He 
200 

Val He 
215 

Glu Cys 
He Gly 
He Ser 



Phe Tyr Thr 
75 



60 
Thr Asp 



Asp Val Lys 
90 

Asn Val Glu 
105 

Pro Glu Phe 



Gin Ser Phe 



Glu Arg Thr 
155 

Val Phe Gly 
170 

Se\r Ser Gly 
185 



Gin Thr 



Ser Thr 



Thr Pro 
125 

Glu Gin 
140 

Leu Val 
Lys Asp 
Lys Lys 



Pro Ser Arc 



Lys Gly 
205 

,Thr Val 
120 



Met Gly Gin 
235 

Ala Val Val 
250 

Leu His 
265 



Phe Val\ 



Thr Glu Cys 
80 

Tyr Leu Ala 
95 

Gly Ser Ala 
110 

Tyr Leu Glu 

Val Gly Thr 

Arg Arg Asn 
160 

Leu He Tyr 
175 

Thr Ala Lys 
190 

Glu Asn Tyr 
Asn Arg Lys 



Gly Gin Phe 
240 

Val He He 
255 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Lys Asn He Ala Phe Leu Leu Arg Lys 
15 10 
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,(2) INFORMATION FOR SEQ ID NO: 13; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Lys Lys Asn lie Ala Phe Leu Leu Arg Arg 
1 \ 5 10 

(2) INFORMATION FO^SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: Pi amino acids 

(B) TYPE: amin\ acid 

(C) STRANDEDNESSn: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: pept: 
(xi) SEQUENCE DESCRIPTION: 



ID NO: 14: 



Met Lys Lys Asn lie Ala Phe LeuVLeu Lys Lys 
1 5 \ 10 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 5 . .42 

(D) OTHER INFORMATION: /note= "Complementary 

double- stranded binding between bases 5 ^nd 42 to 
SEQ ID NO: 16. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTAGAATTAT GAAAAAGAAT ATCGCATTTC TTCTTAAACG GG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 



42 
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(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

i) MOLECULE TYPE: cDNA 



( ix)\ FEATURE : 

\(A) NAME/KEY: misc_f eature 
^B) LOCATION: 4. .41 

(DO OTHER INFORMATION: /note= "Complementary 

\ double -stranded binding between bases 4 and 41 to 
\ SEQ ID NO: 15." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGTCCCGTTT AAGAAGAAAt GCGATATTCT TTTTCATAAT T 41 
(2) INFORMATION FOR SECKID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic\acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: lineal 

(ii) MOLECULE TYPE: cDNA Tfr—^ 



(ix) FEATURE: \ 

(A) NAME/KEY: misc_f eatureX 

(B) LOCATION: 5.. 42 \ 

(D) OTHER INFORMATION: /no te^V "Complementary 

double -stranded binding\between bases 5 and 42 to 
SEQ ID NO: 18." \ 

(ix) FEATURE: \ 

(A) NAME/KEY: CDS \ 

(B) LOCATION: 9. .41 \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: \ 

CTAGAATT ATG TTC CCA GCT ATG CCT CTA TCT AGT AAA C£G G 42 
Met Phe Pro Ala Met Pro Leu Ser Ser Lys Ark 
1 5 10 \ 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



1 

(2) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Phe Pro Ala Met Pro Leu Ser Ser Lys Arg 
5 10 

FORMATION FOR SEQ ID NO: 18: 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
^(B) TYPE: nucleic acid 
;) STRANDEDNESS : both 
(iH TOPOLOGY: linear 




(ii) MOLEC 



TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION^ 4. .41 
(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between bases 4 and 41 to 
SEQ ID Nol\7 . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



AGTCCCGTTT ACTAGATAGA GGCATAGCTG GC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



KAAT T 




41 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 5 . . 64 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 5 . . 64 

(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between bases 5 and 
SEQ ID NO: 21." 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CGCXCAG ATT GCC ATT TGC GGC ATG AGC ACC TGG AGC AAA AGG AAA CCC 
Ln lie Ala lie Cys Gly Met Ser Thr Trp Ser Lys Arg Lys Pro 
5 10 15 

ACT GGT T^T GGT TCT 
Thr Gly Tyr\£ly Ser 
20 



49 



64 



(2) INFORMATION K)R SEQ ID NO: 22: 

(i) SEQUENcW CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPEs: amino acid 
(D) TOPOEOGY: linear 

(ii) MOLECULE TYPEV protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Gin He Ala He Cys Gly Met 
1 5 

Gly Tyr Gly Ser 
20 



, r Thr Trp Ser Lyg Arg Lys Pro Thr 
10 15 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3. .62 

(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between basics 3 and 62 to 
SEQ ID NO: 20." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGAGAACCAT AACCAGTGGG TTTCCTTTTG CTCCAGGTGC TCATGCCGCA AA^GGCAATC 



60 



TG 



62 
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( 2 )\ INFORMATION FOR SEQ ID NO: 23: 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
L (C) STRANDEDNESS: both 

CD) TOPOLOGY: linear 

(ii) MOLkcULE TYPE: cDNA 

(ix) FEATUF 

(A) NAHE/KEY: misc_feature 

(B) LOCATION: 5 . . 50 

(D) OTHElK INFORMATION: /note= "Complementary 

double -stranded binding between bases 5 and 50 
with\EQ ID NO: 24." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGCCACTCTG TGCGGTGCTG AACTGCTITGA CGCTCTGCAG TTTGTTTGCG 
(2) INFORMATION FOR SEQ ID NO:: 

(i) SEQUENCE CHARACTER I ST I CS> 

(A) LENGTH: 50 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



50 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 6. .50 

(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between^ases 6 and 50 
with SEQ ID NO: 23." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTCACCGCAA ACAAACTGCA GAGCGTCAAC CAGTTCAGCA CCGCAAGAGT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .55 

(D) OTHER INFORMATION: /note- "Complementary 

double- stranded binding between bases 1 and 55 
with SEQ ID NO: 26." 



(xi) Sequence description: seq id no:25: 

GGTCCCGAAA C^CTGTGCGG TGCTGAACTG GTTGACGCTC TGCAGTTTGT TTGCG 
(2) INFORMATION\FOR SEQ ID NO: 26: 

(i) sequence\haracteristics: 

(A) LENGTH,: 64 base pairs 

(B) TYPE: nXicleic acid 

(C) STRANDEDteSS: both 

(D) TOPOLOGY : NJ. inear 



55 



(ii) MOLECULE TYPE: cD* 



(ix) FEATURE: 

(A) NAME/KEY: misc_fe£ 

(B) LOCATION: 6 . . 64 

(D) OTHER INFORMATION: /hote- "Complementary 

double- stranded binding between bases 6 and 64 
with SEQ ID NO: 25. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0: v 
GTCACCGCAA ACAAACTGCA GAGCGTCAAC CAGTTCAGCA \ICGCACAGAG TTTCGGGACC 
TGCA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



60 
64 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 5 . . 84 
(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between bases 5 and 84 
with SEQ ID NO: 28. " 



-16- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CT&GAATTAT gatgattact ctgcgcaaac ttcctctggc ggttgccgtc gcagcgggcg 

TAATCTCTGC TCAGGCCATG GCCA 

(2) I NF^RMAT I ON FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
OO LENGTH: 84 base pairs 
(b\tYPE: nucleic acid 

(C) VrRANDEDNESS: both 

(D) TfctfPOLOGY: linear 

(ii) MOLECULeVyPE : cDNA 



60 
84 



(ix) FEATURE: 

(A) NAME/KEY: Nnisc_f eature 

(B) LOCATION: 5V . 84 
(D) OTHER INFORMATION: /note= "Complementary 

double- straWied binding between bases 5 and 84 
with SEQ ID NO: 27. ' 



(xi) SEQUENCE DESCRIPTION: SEQS 
GATCTGGCCA TGGCCTGAGC AG AC ATT AC G CC 
TTGCGCAGAG TAATCATCAT AATT 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



NO: 28: 

JTGCGA CGGCAACCGC CAGAGGAAGT 



60 
84 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .52 

(D) OTHER INFORMATION: /note= "Complementary 

double- stranded binding between bases 1 and 
with SEQ ID NO: 30. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CAACTCTACA GTGCATTGGC TAATAAATGT TGCCATGTTG GTTGTACCAA AA 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii)NMOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) \lAME/KEY: misc_f eature 

(B) LOCATION: 5. .56 

(D) OTP^R INFORMATION: /note= "Complementary 

>uble- stranded binding between bases 5 and 56 
wikh SEQ ID NO: 29." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GATCTTTTGG TACAACCAAC ATGGSAACAT TTATTAGCCA ATGCACTGTA GAGTTGTGCA 



60 



(2) INFORMATION FOR SEQ ID NO rSl : 

(i) SEQUENCE CHARACTERISTIC? 

(A) LENGTH: 13 amino aci\s 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg ArgNArg Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Lys Lys Arg 
15 10 
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>) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCk DESCRIPTION: SEQ ID NO: 33: 

Lys Arg Lys Pr\ Thr Gly Tyr Gly Ser Arg Arg Arg Lys 
1 \5 10 

(2) INFORMATION FOR SEQV[D NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 

(B) TYPE: nucleic &cid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. .452 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GC GGC CGC GAA TTA GTT CGC GCG CAG ATT GCC ATT 1SGC GGC ATG AGC 
Gly Arg Glu Leu Val Arg Ala Gin lie Ala lie Cy« Gly Met Ser 
1 5 10 \ 15 



47 



ACC TGG AGC AAA AGG TCT CTG AGC CAG GAA GAT GCT CCT SAG ACA CCT 
Thr Trp Ser Lys Arg Ser Leu Ser Gin Glu Asp Ala Pro Gin Thr Pro 
20 25 \ 30 



95 



AGA CCA GTG GCA GAA ATT GTG CCA TCC TTC ATC AAC AAA GAT At$A GAA 
Arg Pro Val Ala Glu lie Val Pro Ser Phe lie Asn Lys Asp Thr^ Glu 
35 40 45 



143 



ACC ATA AAT ATG ATG TCA GAA TTT GTT GCT AAT TTG CCA CAG GAG CT 
Thr lie Asn Met Met Ser Glu Phe Val Ala Asn Leu Pro Gin Glu Leu\ 
50 55 60 



191 



AAG TTA ACC CTG TCT GAG ATG CAG CCA GCA TTA CCA CAG CTA CAA CAA 
Lys Leu Thr Leu Ser Glu Met Gin Pro Ala Leu Pro Gin Leu Gin Gin 
65 70 75 



239 



CAT GTA CCT GTA TTA AAA GAT TCC AGT CTT CTC TTT GAA GAA TTT AAG 



187 
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His\Val Pro Val Leu Lys Asp Ser Ser Leu Leu Phe Glu Glu Phe Lys 
80 \ 85 90 95 

AAA CTT ATT CGC AAT AGA CAA AGT GAA GCC GCA GAC AGC AGT CCT TCA 335 
Lys Leu\Ile Arg Asn Arg Gin Ser Glu Ala Ala Asp Ser Ser Pro Ser 
100 105 110 

GAA TTA AAX TAC TTA GGC TTG GAT ACT CAT TCT CGA AAA AAG AGA CAA 383 
Glu Leu Lys \Tyr Leu Gly Leu Asp Thr His Ser Arg Lys Lys Arg Gin 
H5 120 125 

CTC TAC AGT GCA\TTG GCT AAT AAA TGT TGC CAT GTT GGT TGT ACC AAA 431 
Leu Tyr Ser Ala Deu Ala Asn Lys Cys Cys His Val Gly Cys Thr Lys 
130 \ 135 140 

AGA TCT CTT GCT AGA toT TGC TGAGATGAAG CTAATTGTGC ACATCTCGTA 482 
Arg Ser Leu Ala Arg PHte Cys 
145 \150 

TAATATTCAC AC AT ATT CTT AAT<3^CATTT CACTGATGCT TCTATCAGGT CAATTCTCAT 542 

GTTTGACAGC TTATCATCGA TAAGCT\TAA TGCGGTAGTT TATCACAGTT AAATTGCTAA 602 

CGCAGTCAGG CACCGTGTAT GAAATCTAA\ AATGCGCTCA TCGTCATCCT CGGCACCGTC 662 

ACCCTGGATG CTGTAGGCAT AGGCTTGGTT ^To6c#GTAC TGCCGGGCCT CTTGCGGGAT 722 

ATCGTCCATT CC GAC AGC AT CGCCAGTCAC TA^G^CGTGC TGCTAGCGCT ATATGCGTTG 782 

ATGCAATTTC TATGCGCACC CGTTCTCGGA GCACTG^CCG ACCGCTTTGG CCGCCGCCCA 842 

GTCCTGCTCG CTTCGCTACT TGGAGCCACT ATCGACTAC^ CGATCATGGC GACCACACCC 902 

GTCCTGTGGA TCC \ 915 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE-: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Arg Glu Leu Val Arg Ala Gin lie Ala lie Cys Gly Met Ser Th\ 
1 5 10 15 

Trp Ser Lys Arg Ser Leu Ser Gin Glu Asp Ala Pro Gin Thr Pro Arg 
20 25 30 



Pro Val Ala Glu lie Val Pro Ser Phe lie Asn Lys Asp Thr Glu Thr 
35 40 45 
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\Le Asn Met Met Ser Glu Phe Val Ala Asn Leu Pro Gin Glu Leu Lys 
50 55 60 

Leu tin: Leu Ser Glu Met Gin Pro Ala Leu Pro Gin Leu Gin Gin His 
65 \ 70 75 80 

Val Pro V^l Leu Lys Asp Ser Ser Leu Leu Phe Glu Glu Phe Lys Lys 
85 90 95 

Leu lie Arg ^n Arg Gin Ser Glu Ala Ala Asp Ser Ser Pro Ser Glu 

105 110 

Leu Lys Tyr Leu £%y Leu Asp Thr His Ser Arg Lys Lys Arg Gin Leu 
115 \ 120 125 

Tyr Ser Ala Leu Ala Assn Lys Cys Cys His Val Gly Cys Thr Lys Arg 
130 \135 140 

Ser Leu Ala Arg Phe Cys 
145 150 

(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 301 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7. .297 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



AAGCTT ATG AAA TCT AAC AAT GCG CTC ATC GTC ATC CTC GGC\ACC GTC 
Met Lys Ser Asn Asn Ala Leu lie Val lie Leu Gly \hr Val 
1 5 10 
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ACC CTG GAT GCT GTA GGC ATA GGC TTG GTT ATG CCG GTA CTG CC& GGC 
Thr Leu Asp Ala Val Gly lie Gly Leu Val Met Pro Val Leu ProVly 
15 20 25 \30 
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CTC TTG CGG GAT ATC GTC CAT TCC GAC AGC ATC GCC AGT CAC TAT 
Leu Leu Arg Asp lie Val His Ser Asp Ser lie Ala Ser His Tyr 
35 40 45 



Gly\ 
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GTG CTG CTA GCG CTA TAT GCG TTG ATG CAA TTT CTA TGC GCA CCC GTT 
Val Leu Leu Ala Leu Tyr Ala Leu Met Gin Phe Leu Cys Ala Pro Val 
50 55 60 



192 



CTC GGA GCA CTG TCC GAC CGC TTT GGC CGC CGC CCA GTC CTG CTC GCT 



240 



-21- 

Leu Gly Ala Leu\ Ser Asp Arg Phe Gly Arg Arg Pro Val Leu Leu Ala 
65 \ 70 75 



TCG CTA CTT GGA GC& ACT ATC GAC TAC GCG ATC ATG GCG ACC ACA CCC 
Ser Leu Leu Gly Ala Vhr lie Asp Tyr Ala lie Met Ala Thr Thr Pro 
80 \ 85 90 



288 



GTC CTG TGG ATCC 
Val Leu Trp 
95 



301 



(2) INFORMATION FOR SEQ ID NOW) : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 aminX acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ II 

Met Lys Ser Asn Asn Ala Leu lie Val lie' 
1 5 10 



:40: 

I)eu Gly Thr Val Thr Leu 
15 



Asp Ala Val Gly lie Gly Leu Val Met Pro Val Leu Pro Gly Leu Leu 
20 25 \ 30 

Arg Asp lie Val His Ser Asp Ser lie Ala Ser HisVTyr Gly Val Leu 
35 40 \45 

Leu Ala Leu Tyr Ala Leu Met Gin Phe Leu Cys Ala Pr\ Val Leu Gly 
50 55 60 

Ala Leu Ser Asp Arg Phe Gly Arg Arg Pro Val Leu Leu Ala Ser Leu 
65 70 75 \ 80 



Leu Gly Ala Thr lie Asp Tyr Ala lie Met Ala Thr Thr Pro Val Leu 
85 90 95 



Trp 



